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Abstract 

Background: The Nile tilapia {Oreochromis niloticus) is the second most farmed fish species worldwide. It is also an 
important model for studies offish physiology, particularly because of its broad tolerance to an array of 
environments. It is a good model to study evolutionary mechanisms in vertebrates, because of its close relationship 
to haplochromine cichlids, which have undergone rapid speciation in East Africa. The existing genomic resources 
for Nile tilapia include a genetic map, BAC end sequences and ESTs, but comparative genome analysis and maps of 
quantitative trait loci (QTL) are still limited. 

Results: We have constructed a high-resolution radiation hybrid (RH) panel for the Nile tilapia and genotyped 1358 
markers consisting of 850 genes, 82 markers corresponding to BAC end sequences, 154 microsatellites and 272 
single nucleotide polymorphisms (SNPs). From these, 1296 markers could be associated in 81 RH groups, while 62 
were not linked. The total size of the RH map is 34,084 cR 350 o and 937,310 kb. It covers 88% of the entire genome 
with an estimated inter-marker distance of 742 Kb. Mapping of microsatellites enabled integration to the genetic 
map. We have merged LG8 and LG24 into a single linkage group, and confirmed that LG16-LG21 are also merged. 
The orientation and association of RH groups to each chromosome and LG was confirmed by chromosomal in situ 
hybridizations (FISH) of 55 BACs. Fifty RH groups were localized on the 22 chromosomes while 31 remained small 
orphan groups. Synteny relationships were determined between Nile tilapia, stickleback, medaka and pufferfish. 

Conclusion: The RH map and associated FISH map provide a valuable gene-ordered resource for gene mapping 
and QTL studies. All genetic linkage groups with their corresponding RH groups now have a corresponding 
chromosome which can be identified in the karyotype. Placement of conserved segments indicated that multiple 
inter-chromosomal rearrangements have occurred between Nile tilapia and the other model fishes. These maps 
represent a valuable resource for organizing the forthcoming genome sequence of Nile tilapia, and provide a 
foundation for evolutionary studies of East African cichlid fishes. 
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Background 

Tilapia is a common name for a large number of species 
belonging to the order Perciformes which accounts for 
one fourth of all vertebrate species. They are members 
of the family Cichlidae which consists of more than 
3000 species distributed across tropical and subtropical 
regions. Tilapia are currently the second most farmed 
fish in the world with an annual production exceeding 
2.8 million tons in 2010 [1]. Tilapia are a valuable source 
of protein for developing and emerging countries, but it 
is now also a prime fish commodity in developed coun- 
tries. Apart from their domestic importance, there is a 
wealth of studies on different aspects of tilapia biology, 
e.g. on their physiology, endocrinology, immunology, 
toxicology and genetics. Tilapia have a short generation 
time, are sufficiently large in size for physiological stud- 
ies and can be easily reared making them a perfect 
model system. They exhibit a versatile adaptability to 
different environmental conditions to match the vast 
array of their ecological habitats. They can tolerate in- 
credible variations in temperature (12 to 43°C), pH (6 to 
10), salinity (0 to 135 g/L), and oxygen levels (0.3 to 
1.5 mg/L [2-5]). Therefore, they constitute exquisite 
models for environmental genomics, to analyse the 
interactions between the genome and the environment, 
and the adaptive responses to environmental stresses [6] . 
Because tilapia are closely related to the cichlid fishes in 
the Great Lakes of East Africa, which have undergone a 
spectacular radiation, they will contribute to our under- 
standing of evolutionary mechanisms. The 2000 cichlid 
species in these lakes represent a collection of natural 
mutants that may provide insight into the genetic 
mechanisms of speciation and adaptation [7]. These 
unique biological features have motivated the develop- 
ment of a range of genomic tools for the Nile tilapia, 
Oreochromis niloticus, one of the most farmed tilapia 
species. An extensive collection of ESTs was recently 
constructed to aid the annotation of the forthcoming 
Nile tilapia genome and for gene expression studies [8]. 
Likewise the analysis of 106,259 BAC end sequences and 
their alignment on the genome sequence of four model 
fish species (stickleback, medaka, pufferfish and zebra- 
fish) provides a valuable intermediate resource for the 
mapping of genes in cichlids [9]. The culmination of 
these efforts is the whole genome sequence currently 
being assembled by the Broad Institute (Cambridge, 
USA). 

There are several economic traits in tilapia such as 
growth [10] and sex-ratio [11] that need improvement 
and require genetic markers for their selection. Likewise, 
identification of QTLs (Quantitative Trait Loci) for other 
economic traits are being performed in tilapia [12] as 
well as for immune responses [13]. Two Nile tilapia gen- 
etic maps were constructed, for QTL mapping and for 



selection purposes, of which the latest contains 538 
microsatellites and 21 gene markers [14,15]. These gen- 
etic maps established 24 linkage groups although the til- 
apia karyotype is composed of just 22 chromosome pairs 
[16]. These genetic maps provided a first characterization 
of the tilapia genomes. Because only a few gene-based 
markers were mapped, synteny relationships with model 
fish species were only possible at low resolution. The map 
has been updated with a few more gene-based markers 
but the number of comparative markers remains limited 
[17,18]. 

Radiation hybrid (RH) mapping is suited to mapping 
all types of markers including gene-based markers, and 
can order them at high resolution. RH maps can inte- 
grate genetic maps through the mapping of polymorphic 
markers, as well as construct comparative maps through 
the mapping of non polymorphic markers (orthologous 
genes). In zebrafish the RH map allowed much higher 
gene-marker coverage of the genome and permitted 
comparisons with other vertebrates [19,20]. These 
advantages are particularly evident for fish species lack- 
ing genome tools, such as sea bass, where a gene-based 
RH map enabled comparisons with the genomes of 
stickleback, pufferfish, medaka and zebrafish [21]. Like- 
wise, the two RH maps constructed for the seabream 
allowed comparisons with the pufferfish genome se- 
quence [22,23]. Global synteny relationships were also 
established between three farmed Perciformes (seab- 
ream, European seabass and Nile tilapia) and with the 
model species (stickleback, medaka and pufferfish) [24]. 

High-resolution RH maps are also of great help in 
scaffolding genome sequences developed in shotgun pro- 
jects [25,26]. In many cases draft and even "finished" 
genome sequences from shotgun projects contain large 
sequence gaps that imply inconsistencies in the place- 
ment of scaffolds. In addition, low in-depth sequences 
lack long-range continuity and provide only a fragmen- 
ted view of a genome. This was precisely the case for the 
fugu genome sequence, which consists of 7213 uncon- 
nected scaffolds without any chromosome assignment 
[27]. The construction of RH panels for fish species has 
not been an easy task. To date, only four fish RH panels 
have been reported. Two zebrafish RH panels were 
derived from permanent cell lines [20,28]. More recently, 
a seabream RH panel was constructed from primary 
fibroblasts [22] and the European seabass RH panel was 
derived from splenocytes [21], thus avoiding the pro- 
blems of genome rearrangements that arise in cell lines. 

Here we describe the construction of a Nile tilapia RH 
panel derived from fresh splenocytes and a gene-rich RH 
map of 1358 markers. The RH map was integrated with 
the Nile tilapia karyotype by FISH analysis. This allowed 
us to assign the RH groups to the 22 chromosomes as well 
as to identify their orientation with respect to the 
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centromere. The mapping of 154 microsatellites permitted 
the anchoring of the genetic map to the RH map. 
Amongst the different markers selected for the Nile tilapia 
RH map there was a large proportion related to growth 
and reproduction. In addition, a large number of SNPs 
identified in individuals from the 10 th generation of the 
widely cultured GIFT (Genetically Improved Farmed Til- 
apia) strain were included in the RH map (van Bers at al, 
submitted to Molecular Ecology Resources). 

Results and Discussion 

RH panel 

We fused Hprt~ derivative CHO host cells with Nile til- 
apia splenocytes y-irradiated at 3500 rad. Hybrid cells 
maintaining Nile tilapia chromosome fragments were 
selected based on their growth in HAT medium. A total 
of 381 hybrid cell lines were obtained through 3 fusion 
experiments. The retention frequency (i.e. the estimated 
percent of markers per clone) was determined for every 
clone by typing a set of 48 microsatellites selected from 
the genetic map [15]. We selected 190 hybrid cell lines 
on the basis of their retention frequency and their gen- 
ome representation (Figure 1). Further typing of 56 add- 
itional markers on this 190 cell line panel led to a 



cumulative retention frequency of 11.7 %. This rather 
low retention value is compensated by the unusually 
high number of hybrid cell lines, which allowed us to 
substantially increase the number of genotyping data. 
The Nile tilapia RH panel was therefore constructed 
from fresh live cells that required no primary culture, an 
important condition to avoid genome rearrangements 
typical of permanent cell lines. Splenocytes are conveni- 
ent to use, as they are abundant, their dissociation is 
easy and can be performed in a relatively short time pre- 
ceding the irradiation step. 

Marker selection 

A total of 16,195 Nile tilapia expressed sequence tags 
(ESTs) were collected, consisting of 5161 sequences from 
the CIRAD, 3537 sequences from the NCBI nucleotide 
database and 7497 sequences from the RBEST database 
(October 2008). A proportion of 3.5 % of these sequences 
were identified as simple repeats and masked by the 
RepeatMasker program. Sequence alignment with the 
CAP3 software resulted in 1476 contigs and 5692 singlets 
i.e. 7168 unique sequences putatively corresponding to as 
many genes (Additional file 1: data SI). These 7168 
sequences were aligned onto the stickleback, pufferfish, 




161 181 201 221 
Hybrid cell lines 



381 



Figure 1 Retention frequency of the Nile tilapia hybrid cell lines. The hybrid cell lines are numbered from 1 to 381 on the X axis. Presence/ 
absence of 48 microsatellite markers spread all over the tilapia genome [15] was estimated by PCR determination. Their retention frequency per 
clone is presented on the Y axis. The 190 hybrid cell lines selected on quantitative and qualitative criteria that constitutes the tilapia RH panel are 
in green. 
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medaka and zebrafish genome sequences using the Exon- 
erate software. A minimum score of 250, corresponding 
to a minimal alignment length of 50 bp, was applied fol- 
lowing the usual recommendations to map orthologous 
ESTs [29]. A maximal alignment size of 300 bp was 
imposed to avoid hits that may in fact correspond to ret- 
rogenes rather than orthologs (retrogenes as opposed to 
pseudogenes which are characterized by non-sense or 
frame shift mutations). According to these criteria, 2475 
of the unique sequences had a hit with at least one of the 
model genomes: 1920 (77.6%) had a hit with stickleback, 
1836 (74.3%) with medaka, 1715 (69.3%) with pufferfish 
and 1304 (52.7%) with zebrafish (Figure 2). A total of 942 
Nile tilapia sequences were conserved across all four 
model species, while 224 Nile tilapia sequences were con- 
served exclusively with stickleback, 157 with medaka, 117 
with pufferfish and 90 with zebrafish. 

We preferentially designed gene markers from Nile til- 
apia ESTs having the highest conservation with the 
stickleback. Although Nile tilapia is phylogenetically 
closer to medaka [30], the stickleback genome appears 
to be the best reference sequence because it is assembled 
with highest confidence [9]. Each marker was designed 
from the aligned region having the best homology (high- 
est score) with the reference genome. This strategy mini- 
mized the possibility of an intron lying between the 
oligonucleotides. The Nile tilapia Illumina BeadArray 
contained 1536 markers consisting of 1300 genes, 97 
BACs and 139 microsatellites. Sequence analysis of a 
Nile tilapia reduced representation library (RRL) resulted 
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Figure 2 Venn diagram representing the distribution of 
markers shared by Nile tilapia and stickleback/medaka/ 
pufferfish/zebrafish. Each model species is represented by an 
ellipse. Number of markers shared by two species or more are 
indicated in every intersection. For each model species, the number 
of markers and the percentage of the 2475 Nile tilapia markers are 
indicated. 



in the detection of 3569 SNPs. Of these, a subset of 384 
SNPs was genotyped on the RH panel. 

Genotyping 

Out of the initial 1536 selected markers and the subse- 
quent 384 SNP markers genotyped on the RH panel, 
1026 and 272 markers respectively showed an exploit- 
able profile and were selected to construct the RH map. 
Roughly one third of the markers had to be removed 
from the initial set, as their calls could not be separated 
in two distinct clusters of presence and absence on the 
graphic representation. This ratio of failure is high, but 
is in the range of what was observed for our previous 
RH map constructions using PCR genotyping (dog, seab- 
ream and seabass). With this strategy a similar propor- 
tion of designed pairs of oligonucleotides either do not 
properly amplify the test DNA or amplify both the test 
DNA and the carrier hamster cell DNA and thus are not 
useful for RH map construction. The current Nile tilapia 
map was constructed with a final set of 1358 markers 
consisting of 850 genes, 154 microsatellites, 82 BACs 
and 272 SNP, including 60 markers genotyped by PCR 
(see the vector suite in Additional file 2: data S2). 

RH group characteristics 

The two-point analysis of the 1358 markers using the 
Multimap software started at a lod score of 4.0, which 
was then increased in a step-wise fashion up to a thresh- 
old of lod 7.0. The final map consists of 81 RH groups 
containing between 2 to 89 markers each for a total of 
1296 markers. Sixty- two markers remained unlinked. 
The 1296 markers are spread in 1255 positions of which 
1220 positions consist of a single marker, 32 positions 
contain two co-localized markers and three positions 
contain 3 to 6 co-localized markers. Multipoint analysis 
was carried out with CarthaGene software that ordered 
markers within each RH group and determined their dis- 
tances expressed in centirays (cR 350 o)- RH groups ranged 
in size from 5 to 1906 cR 350a Inter-marker distances 
vary between 1 and 164 cR 350 o with an average of 27 
cR 35 oo. Characteristics of the RH groups in terms of size 
and marker content are presented in Additional file 3: 
S3. Assuming a Nile tilapia genome size of 1060 Mb [31] 
the mapping of 1358 markers corresponds to a density 
of 1.28 marker per megabase or one marker per 780 kb, 
when considering an even distribution of the markers. 

The relationship between cR 350 o and kilobase (kb) can 
be estimated from the ratio of RH to genetic distances 
(cR/cM) knowing the ratio between cM and kb. To this 
end, we identified 82 pairs of microsatellites separated by 
a known distance measured in genetic and RH units. The 
cumulated distance of these 82 couples is 18,446 cR 350 o 
on the RH map and 604 cM on the genetic map i.e. a ratio 
of 30.5 cR 350 o/cM. The size of the Nile genome being 
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1060 Mb [31] and the size of the genetic map being 
1311 cM the ratio was estimated to 840 kb/cM [15]. 
Therefore, the relation of physical unit to RH unit is esti- 
mated to be 27.5 kb/cR 350 o. The calculated size in kb of 
the RH map is 937,310 kb (34,084 cR x 27.5) and thus cor- 
responds to a coverage of 88% of the entire genome size. 
Considering that -96% of the markers (1298 markers out 
of 1358) were mapped in the 81 RH groups, one can esti- 
mate the probability of mapping a novel marker of interest 
in one of the existing RH group to be 96 %. This figure 
can most likely be considered as a better estimate of the 
coverage of the Nile RH map. 

Because most of the markers designed for the Nile til- 
apia RH map were from ESTs, the genome regions that 
remain uncovered may correspond to gene-poor regions 
such as heterochromatin or regions containing genes 
that are poorly expressed. 

Integration of RH map and FISH data 

The FISH mapping of BAC clones analyzed two by two 
allowed us (i) to assign RH groups to specific chromo- 
somes with higher confidence, (ii) to orient them relative 
to each other and (iii) to localize centromeric and telo- 
meric ends on the chromosome maps. They also served 
as a validation of linkage group assignment. In addition 
to ordering based on FISH mapping, the RH groups 
were tentatively ordered and orientated on the basis of 
the two-point analyses between markers close to RH 
group extremities and on the basis of the genetic map. 
The chromosome map of LG7 featuring RH and FISH 
maps along with the genetic map is presented in Figure 3. 
All chromosome maps are available in Additional file 4: 
data S4 and online (http://www.BouillaBase.org). Table 1 
presents chromosome characteristics in terms of number 
of markers and RH groups. We first selected 42 BAC 
clones that contained markers assigned to 33 RH. A sec- 
ond set of 48 BAC clones was selected from regions of 
interest based on synteny with reference species, obtained 
from the BouillaBase server. 

Of the 90 initially selected BAC clones, FISH data 
from 45 BACs were not taken into account. They corres- 
pond to BACs producing many signals on different 
chromosome pairs (chimeric BACs or BACs that hybridize 
to duplicated regions), and/or to the observation of strong 
background signals in spite of the use of competitor and 
carrier DNA. They were deleted from the analysis and in 
many circumstances replaced by other BACs from the 
same linkage group. Successful hybridization results were 
those in which the two probes for the same LG group gave 
a clear signal and could be repeatedly observed in at least 
10 metaphase spreads. Thus a total of 55 BAC markers 
(Table 2) were successfully hybridized and mapped in 39 
RH groups. This allowed all of the main RH groups 
(n > 30 markers) to have at least one BAC mapped by RH 



and FISH. For 17 chromosomes, two to four BACs were 
used to assemble several RH groups onto a single chromo- 
some (Table 2). This was the case for instance, of LG7, 
constituted by three RH groups (Figure 3). Each of the 22 
chromosomes can now be identified with the help of one 
to four fluorescent probes. This is a particularly important 
result because, with the exception of the largest chromo- 
some pair (Chrl/LG3), which is three times larger than 
any other Nile tilapia chromosomes and of the second 
chromosome pair (Chr2/LG7), none of the other chromo- 
somes can be easily distinguished using classical cytogen- 
etic techniques, due to similarity in size and fluctuations 
in the chromatin condensation [32,33]. These BAC 
chromosome markers can also be used to identify ortholo- 
gous chromosomal regions among closely related species 
within the Tilapia group (such as other Oreochromis spp., 
or Sarotherodon). They can also be used to provide 
insights on the evolution of chromosome regions that 
have taken place since the divergence of tilapia and other 
cichlids from their ancestors. The list of clones, their refer- 
ence and chromosome assignation is presented in Table 2. 

All the BAC probes hybridized to the long arm of the 
chromosomes (such as LG7, shown in Figure 3) with the 
exception of LG15. This LG is composed of two RH 
groups and the BAC probes taken from each RH group 
both hybridized to the short arm of the chromosome. 
Chromosome LG15 is a small submetacentric chromo- 
some in which the small arm is often clearly visible. It 
very probably corresponds to the chromosome 6 as 
defined by Ferreira et al. [34]. 

Integration of RH and genetic maps 

The published genetic map of Nile tilapia was con- 
structed with 545 microsatellite markers and 20 gene 
markers present on 24 linkage groups (LG) [15]. The in- 
tegration of the RH map onto this genetic map was 
established using the 132 microsatellites present in both 
maps. Hence, fifty of the 81 RH groups were connected 
to the 24 genetic linkage groups, placing them onto 22 
chromosome maps which contained on average 2.3 RH 
groups per chromosome. These 50 RH groups totalize 
1123 map positions containing 1161 markers, which rep- 
resent 89.4% of the markers located on the map. The 
remaining 31 small RH groups containing two to nine 
markers totalize 137 markers, which correspond to 
10.6% of the markers on the map. They cannot be 
assigned to any chromosomes presently. These groups 
ranged in size from 5 to 345 cR 350 o. 

In the RH map we were able to associate two small 
LGs, LG8 and LG24 into a single chromosome by RH 
mapping of four microsatellite markers (GM027 and 
UNH129 from LG8, GM104 and GM173 from LG24). 
We also merged and confirmed by FISH analysis the 
previously grouped LG16 and LG21 into a unique 
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Figure 3 (A) Integrated genetic-RH-FISH map of the tilapia chromosome LG7. The RH map on the middle consists of three RH groups 
containing ordered markers whose coordinates are indicated in cR 350 o- Microsatellites (in blue) allowed the anchorage of the RH map to the 
genetic map [15] figured by a vertical bar on the left. Double-FISH of BAC clones highlighted by a red or green frame indicate the relative 
position of the RH groups on the chromosome symbolized on the right side. The chromosome is orientated with its centromere up. (B) Results 
of Double-FISH experiment of BAC clone WG0AAA35YD23HM1 revealed with FITC (green) and BAC clone WG0AAA16YE01 HM1 revealed with 
Rhodamin (red) on a chromosome preparation. (C) Results of Double-FISH experiment of BAC clone WG0AAA35YD23HM1 revealed with FITC 
(green) and BAC clone WG0ACA29YJ13M1 revealed with Rhodamin (red) on a chromosome preparation. 



chromosome [16]. LG2, LG11, LG12, LG14 and LG23 
correspond each to a single RH group whereas 15 LGs 
are made of two to five RH groups (see table 1). Conse- 
quently we have been able to locate the 24 linkage 
groups and placed them onto 22 chromosome pairs. For 
simplicity, we named the chromosomes maps after the 
genetic linkage groups (LG) of the Nile tilapia genome 
[15]. 

Overall the microsatellites in LG11, LG12, LG14 or 
LG23 are in the same order in the RH and genetic maps 
except for small local inversions that may be due to vec- 
tor quality (see the data computation chapter) in one of 



the mapping methodologies. However, a larger discrep- 
ancy was observed in the lower part of LG14. This RH 
group was tentatively broken at higher lod scores (up to 
7.0) but no reordering of the resulting groups was con- 
sistent with the genetic map. Given the high lod score to 
which this group stayed unbroken we believe that the 
correct order is that of the RH map. 

Comparative genomics 

Synteny relationships were established from the markers 
of the assigned RH groups having localized orthologous 
genes in the sequences of model species. Orthologs 
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Table 1 Characteristics of Nile tilapia chromosome maps 
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localized in the "chromosome unknown" of model spe- 
cies were not taken into account. Of the 1123 mapped 
positions in the assigned RH groups, 277 markers 
allowed anchorage of the Nile tilapia genome with all 
four model fish species, 268 with three species, 165 with 
two species and 78 with one species representing a total 
of 788 orthologous markers providing 2320 anchors. 
Synteny relationships identified by two or more consecu- 
tive conserved markers defined a conserved segment 
(CS) while a single marker identified a singleton [35] 
[36]. 

The Oxford grid shown in Figure 4A recapitulates the 
CS found between Nile tilapia and stickleback. The Nile 
tilapia RH map and the stickleback genome sequence 
were connected by 713 anchors defining 23 CS. Seven- 
teen CS correspond to entire chromosomes in which 
synteny is perfectly conserved between the two species. 
The Nile tilapia chromosome LG7 is made of two CS 
corresponding to stickleback chromosomes GAC14 and 
GAC19. Conversely, Nile tilapia chromosomes LG2 and 



LG17 consist of one CS each that are fused in stickle- 
back to form chromosome GAC04. Furthermore, Nile 
tilapia chromosomes LG3 and LG10 both correspond to 
stickleback chromosome GAC07. This pattern of syn- 
teny would imply at least three inter-chromosomal rear- 
rangements between the two lineages. The presence of 
two interstitial telomeric signals in Nile tilapia LG3 
[32,33] suggests that this chromosome arose by two 
fusions. It has been suggested that these occurred within 
the cichlid lineage [34] but the stickleback-tilapia syn- 
teny (LG3 - GAC07) may suggest that these are older. 

A total of 623 anchors identified 24 CS connecting the 
Nile tilapia RH map and the medaka genome sequence 
(Figure 4B). Synteny is entirely conserved between 20 
chromosomes of the two species. As with stickleback, 
the Nile tilapia chromosome LG7 is made of two CS 
corresponding to medaka chromosomes OLA06 and 
OLA12. The Nile tilapia chromosome LG15 is also made 
of two CS, a large one corresponding to medaka 
chromosome OLA24 and a small one corresponding to 
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Table 2 BAC Markers positioned by RH mapping and FISH analysis 



BAC 




BAC Markers 


Genoscope Name 


384 Name 


FPC Name 


LG1 


WG0AAA14YI14RM1 


WGOAAA 14Y1 14 


b03TI048N4 


b03TI048BE07 




WG0AAA46YC14 


WG0AAA46YC14 


b03TI090C14 


b03TI090BB07 




WG0AAA1 3YF01 HM1 


WGOAAA 13YF01 


b03TI047F01 


b03TI047CC01 




WG0AAA42YA07HM1 


WG0AAA42YA07 


b03TI086A07 


b03TI086AA04 


LG2 


WG0AAA30YG 1 9H M 1 _ATRX 


WG0AAA30YG19 


b03TI074G19 


b03TI074AD10 




WG0AAA2YH 1 8HM 1 _FGF24 


WG0AAA2YH 1 8 


b03TI032H18 


b03TI032DD09 


LG3 


WG0AAA13YB1 1RM1 


WGOAAA 13YB1 1 


b03TI047B1 1 


b03TI047CA06 




WG0AAA36YM24RM 1 


WG0AAA36YM24 


b03TI080M24 


b03TI080BG12 


LG4 


WG0AAA1 1YA12 


WGOAAA 1 1YA12 


b03TI045A12 


b03TI045BA06 




WG0AAA22YF1 1 HM1_LHX9 


WG0AAA22YF1 1 


b03TI066F1 1 


b03TI066CC06 


LG5 


WG0ACA44YI02 


WG0ACA44YI02 


b03TI060l02 


b04TI060BE01 




WG0AAA44YK23HM 1 _BAP 


WG0AAA44YK23 


b03TI088K23 


b03TI088AF12 




WG0AAA44YP 1 9RM 1 _LTPR 1 


WG0AAA44YP19 


b03TI088P19 


b03TI088CH10 




WG0AAA22YB14M1 


WG0AAA22YB14 


b03TI066B14 


b03TI066DA07 


LG6 


WG0AAA45YN 1 8RM1 _TP53BP2 


WG0AAA45YN 1 8 


b03TI089N18 


b03TI089DG09 




WG0AAA34YG21 HM1_MCM5 


WG0AAA34YG21 


b03TI078G21 


b03TI078AD1 1 




WG0AAA3 1 YE 1 6HM 1 _ALD0A 


WG0AAA31YE16 


b03TI075E16 


b03TI075BC08 


LG7 


WG0ACA29YJ13M1 


WG0ACA29YJ 1 3 


b04TI039J13 


b04TI039CE07 




WGO A A A 1 6YE0 1 H M 1 _RERG 


WGOAAA 16YE01 


b03TI050E01 


b03TI050AC01 




WG0AAA35YD23HM1_KCNE1 L 


WG0AAA35YD23 


b03TI079D23 


b03TI079CB12 


LG8-24 


WG0AAA42YA01 RM1_KIR2.1_A 


WG0AAA42YA01 


b03TI086A01 


b03TI086AA01 




WG0AAA33YD12 


WG0AAA33YD12 


b03TI077D12 


b03TI077DB06 




WG0AAA46YD19 


WG0AAA46YD19 


b03TI090D19 


b03TI090CB10 


LG9 


WG0AAA28YB24HM1_NPPC 


WG0AAA28YB24 


b03TI072B24 


b03TI072DA12 




WG0AAA13YJ04M1 


WGOAAA 13YJ04 


b03TI047J04 


b03TI047DE02 


LG10 


WG0AAA38YC08HM 1 _TGFB3 


WG0AAA38YC08 


b03TI082C08 


b03TI082BB04 




WG0AAA2YB24HM 1 _LOC485593 


WG0AAA2YB24 


b03TI032B24 


b03TI032DA12 


LG11 


WG0AAA42YO20RM 1 _TGFB2R 


WG0AAA42YO20 


b03TI086O20 


b03TI086BH10 




WGOAAA 1 6YH 1 7HM1 _DLX3 


WGOAAA 16YH1 7 


b03TI050H17 


b03TI050CD09 




WG0ACA24YM03 


WG0ACA24YM03 


b04Ti034M03 


b04TI034AG02 


LG12 


WG0AAA1 6YK1 8HM 1 _LIM6 


WG0AAA16YK18 


b03TI050K18 


b03TI050BF09 




WG0ACA19YO21M1 


WG0ACA19YO21 


b04TI 029021 


b04TI029AH1 1 


LG13 


WGO A A A4 1 YB 1 5 RM 1 


WG0AAA41YB15 


b03TI085B15 


b03TI085CA08 




WG0ACA52YD05 


WG0ACA52YD05 


b04TI078d05 


b04TI078CB03 




WG0AAA35YG 16HM1 _CLIC4 


WG0AAA35YG16 


b03TI079G16 


b03TI079BD08 


LG14 


WG0AAA4YJ07HM1 _DMRT1 Y 


WG0AAA4YJ07 


b03TI034J07 


b03TI034CE04 




WG0AAA30YO 18HM1 _CLDN 1 3 


WG0AAA30YO18 


b03TI 074018 


b03TI074BH09 


LG15 


WG0AAA47YB05M1 


WG0AAA47YB05 


b03TI091B05 


b03TI091CA03 




WGO A A A29YA 1 5 H M 1 _FSH B 


WG0AAA29YA15 


b03TI073A15 


b03TI073AA08 


LG16-21 


WG0ACA14YN04 


WG0ACA14YN04 


b04TI024N04 


b04TI024DG02 




WG0ACA24YI10M1 


WG0ACA24YI10 


b04TI034H0 


b04TI034BE05 




WG0AAA29YK07HM 1 _CLDN 1 0C 


WG0AAA29YK07 


b03TI073K07 


b03TI073AF04 




WG0AAA34YL09HM1 _GDF6 


WG0AAA34YL09 


b03TI078L09 


b03TI078CF05 


LG17 


WG0AAA28YI20RM 1 _BMP7 


WG0AAA28YI20 


b03TI072l20 


b03TI072BE10 




WGOAAA 1 YC03 RM 1 _AP R_3 


WGOAAA 1YC03 


b03TI031C03 


b03TI031AB02 


LG18 


WGOAAA 15YJ04M1 


WG0AAA15YJ04 


b03TI049J04 


b03TI049DE02 
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Table 2 BAC Markers positioned by RH mapping and FISH analysis (Continued) 





WG0AAA37YF19RM1_RAI2 


WG0AAA37YF19 


b03TI081F19 


b03TI081CC10 


LG19 


WG0AAA33YH 1 0RM 1 _NR5 A2 


WG0AAA33YH10 


b03TI077H10 


b03TI077DD05 




WG0AAA28YF 1 8HM1 _RA1 1 7 


WG0AAA28YF18 


b03TI072F18 


b03TI072DC09 


LG20 


WG0AAA32YO06RM 1 _G ATA5 


WG0AAA32YO06 


b03TI076O06 


b03TI076BH03 




WG0AAA30YN 1 2HM1 _CCA1 


WG0AAA30YN12 


b03TI074N12 


b03TI074DG06 


LG22 


WG0AAA30YF08HM 1 _TGIF2LX 


WG0AAA30YF08 


b03TI074F08 


b03TI074DC04 




WG0AAA12YB12RM1_LFI2 


WG0AAA12YB12 


b03TI046B12 


b03TI046DA06 


LG23 


WG0AAA49YP 1 9M 1 


WG0AAA49YP19 


b03TI093P19 


b03TI093CH10 




WG0AAA16YK10M1 


WG0AAA16YK10 


b03TI050K10 


b03TI050BF05 



medaka chromosome OLA22. This pattern implies two 
inter-chromosomal rearrangements that would have oc- 
curred in one or the other lineage. Medaka chromosome 
OLA02 is the only chromosome with no Nile tilapia 
chromosome counterpart in the Oxford grid (Figure 4B). 
However three arguments suggest that LG23 is the miss- 
ing counterpart of medaka chromosome OLA02 in the 
Oxford grid: (a) a two-point analysis between end- 
markers links orphan group RH36 to LG23 (see Meth- 
ods section) (b) RH36 and medaka chromosome OLA02 
share five ortholog sequences (see Additional file 1: data 
SI) and (c) tilapia marker AF1 16240 located on LG23 
has an ortholog sequence on medaka chromosome 
OLA02. The grouping of RH36 and LG23 creates an 
additional synteny breakpoint with medaka as well as 
with stickleback and pufferfish. 

We identified 449 anchors connecting the Nile tilapia 
RH map and the pufferfish genome (Figure 4C) defining 
24 CS between these two species. Synteny appeared to- 
tally conserved between 14 chromosomes of the two 
species. Chromosome LG1 corresponds to two CS, a 
large one that corresponds to chromosome TNI05 and a 
small CS made of two markers, which corresponds to 
chromosome TNI13. The chromosome LG7 consists of 
two CS, one with chromosome TNI 13 and one with 
chromosome TNI04. Pufferfish chromosome TNI01 is 
made of two CS corresponding to LG2 and LG23 re- 
spectively. Chromosome TNI02 is also made of two CS 
with Nile tilapia chromosomes LG8-24 and LG16-21. A 
small additional segment of chromosome TNI 15 is con- 
served with Nile tilapia chromosome LG8-24. This pat- 
tern of conservation implies four inter-chromosomal 
events. Assignation of the orphan group RH36 to LG23 
as discussed above would create an additional CS and 
would imply another inter-chromosomal event. Add- 
itional CS exist but have not been identified yet such as 
in chromosome LG3 for which most of the pufferfish 
orthologs are located in the "chromosome unknown" file 
of the pufferfish assembly. 

Ten chromosomes show no synteny breakage across 
the four species (bolded in Figure 4A, 4B, 4C). Twenty 
five singletons were identified between Nile tilapia and 



stickleback, 44 between Nile tilapia and medaka and 38 
between Nile tilapia and pufferfish. These singletons 
suggest putative new CS but they also may be artefacts. 
Indeed the orthologous location of a given gene in a 
model species was defined as the best hit on the genome 
sequence of that species. However the best hit may in 
some instances have corresponded to a paralog espe- 
cially when the true ortholog has been lost as hypothe- 
sized by Soler et al. [9] to explain a possibly overestimated 
number of breakpoints. Consequently each singleton will 
have to be established as a new CS by the mapping of add- 
itional and informative markers. 

The number of CS appeared similar between Nile tilapia 
and each of the three reference models investigated in this 
work. This is in concordance with what was previously 
observed in the comparative map of the sea bass genome 
and the same fish models [21]. Finally, Nile tilapia and 
zebrafish were considered too distant phylogenetically to 
establish a pattern of chromosomal conservation despite 
the fact that 531 anchors were identified between the two 
species. 

On an intra-chromosomal scale, Conserved Segments 
Ordered (CSO) are regions in which the order of ortho- 
logous genes is perfectly conserved [35,36]. The simul- 
taneous comparison of several species allowed us to 
ascertain the extent and boundaries of shared CSO while 
also revealing the breakpoints that arose in some 
lineages. The Nile tilapia RH map aimed at identifying 
these CSO with stickleback, medaka and pufferfish in 
order to benefit from the comprehensive sequencing of 
these model genomes. Thus the location on the Nile til- 
apia genome of unmapped genes having a clear ortholo- 
gous relationship with genes of the model species could 
be hypothesized with high confidence. CSO between 
stickleback, medaka, pufferfish and Nile tilapia were 
identified using the AutoGRAPH web server and are 
presented in Table 3. Detailed CSO of LG7 are shown in 
Figure 5. Comparative maps of each of the 22 chromo- 
somes are presented in Additional file 5: data S5. 

We identified 90 CSO between Nile tilapia and stickle- 
back. Chromosomes LG14, LG9, LG10 and LG13 under- 
went few rearrangements with one to two CSO only (1, 
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Figure 4 Oxford grids between Nile tilapia and (A) stickleback, (B) medaka, (C) pufferfish. Chromosomes are named as follows : LG : Nile 
tilapia chromosomes; GAC : stickleback chromosomes; OLA : medaka chromosomes; TNI : pufferfish chromosomes. Conserved chromosomes or 
conserved segments are figured in black squares containing the number of orthologous markers that identify them. Other numbers in the grid 
indicate the number of singletons. Chromosomes showing no synteny breakage between the four species are bolded. 



2, 2, 2 respectively) while for LG4, LG18 and LG20, 
seven, six and seven CSO respectively were identified, 
showing evidence of considerable rearrangement. A total 
of 79 CSO were identified between Nile tilapia and me- 
daka. One CSO was detected in chromosomes LG2 and 
LG10 while LG15 and LG18 were the most rearranged. 
Only 57 CSO were detected between Nile tilapia and 
pufferfish (Table 3). 

A higher number of CSO was identified with stickle- 
back than with medaka and pufferfish. Considering that 
the phylogenetic position based on various parameters 
between Nile tilapia and the other model species indi- 
cates a closer relationship between Nile tilapia and me- 
daka one would rather have expected the contrary 



[30,37]. However, in terms of sequence similarity Nile 
tilapia is actually closer to stickleback than to medaka 
and pufferfish. This apparent discrepancy, if confirmed, 
would indicate that forces shaping the overall genome 
structure are different from those affecting gene function 
and evolution. Finally, from a technical standpoint, it is 
important to recall that the number of observed CSO 
shared by two species depends in part on the number of 
anchors used to establish the respective comparative 
maps. Indeed the higher the number of orthologous 
genes is, the more resolution the comparative map will 
have. Ideally in this study, the comparative maps should 
be based solely on 1:1:1:1 orthologs between Nile tilapia, 
stickleback, medaka and pufferfish. However only half of 
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Table 3 Syntheny relationships identified with the 
genomes of model fish species 
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CS: Conserved Segments. 

CSO: Conserved Segment Ordered. 

the markers satisfy this condition. Comparative maps 
based on this smaller number of markers have too little 
resolution to reach any conclusions. 

The sequence assembly status in some genomic regions 
of model species also prevents the identification of CSO 
with Nile tilapia. For example, assemblies of stickleback 
GAC17 (syntenic of Nile tilapia LG5), medaka OLA09 
(syntenic of LG12), pufferfish TNI04 (syntenic of LG7) or 
TNI19 (syntenic of LG17) are incomplete and lead to an 
underestimation of the number of CSO. Assuming a con- 
served gene order with the Nile tilapia the RH map would 
provide an opportunity to locate the unassigned contigs of 
model species. In this way the Nile tilapia RH map can be 
seen as a tool for improving the sequence assemblies of 
other fish species. 

Chromosomes LG10, LG14, LG9 and LG13 appear to 
be the least rearranged between Nile tilapia and the 
three model genomes. Conversely LG5, LG6 and LG18 
were the most rearranged. This observation suggests 
that the genome plasticity and the underlying evolution- 
ary constraints are not evenly distributed across the 
genome. 



Conclusions 

Through a spectacular decrease in cost and with the 
capability to generate more than hundred gigabases per 
week, the New Sequencing Technologies (NGS) have 
revolutionized the field of genomics over the last few 
years. It is now possible to obtain deep knowledge of the 
genomes of many more species than we could have 
dreamed of even ten years ago. However the main draw- 
back of NGS is the short length of their reads. Although 
steadily increasing, sequence reads are still very short 
(-100 nucleotides). This is not a problem when the goal 
is to re-sequence individuals and align the reads to a 
reference sequence. However this short size, even with a 
pair ends sequencing approach, renders the problem of 
de novo sequencing of large genomes difficult. Many of 
the novel assemblies produced with this approach are 
composed of a very large number of scaffolds [38]. This 
discontinuity does not affect gene discovery, polymorph- 
ism analysis and sequence comparison between closely 
related species but it greatly limits the study of the gen- 
ome structure and evolution. RH mapping and FISH 
mapping of markers present in different contigs and 
scaffolds allow to link them and deduce larger super 
scaffolds. 

Here we report the construction of a high-resolution 
RH map of Nile tilapia containing ESTs, genes, microsa- 
tellites and SNPs. The RH map has an estimated density 
of one marker every 780 kb. Fifty RH groups, which con- 
tained the vast majority of the markers (1161 out of 
1358), were assigned to the 24 previously known LGs, 
which in turn were located and oriented on the 22 Nile 
tilapia chromosomes through BAC multicolor FISH 
mapping on metaphase chromosomes. Already this RH 
map allows us to locate a large number of physiologic- 
ally important genes. For example, group RH17 located 
on chromosome LG15 contains the estrogen receptor 
gene together with gata4 and the follicle-stimulating 
hormone gene (fshb). This last gene has been shown to 
regulate the activity of Gata4, a transcription factor 
involved in ovarian function, by regulating the aromatase 
cypl9 gene [39]. Likewise, the growth hormone receptor 
gene ghrl (marker C456) was mapped in the group RH3 
assigned to chromosome LG12 and the growth hormone 
receptor gene ghr2 (marker C474) was mapped in the 
group RH9 assigned to chromosome LG7. 

The RH map associated to the FISH data also offers a 
detailed synteny analysis with three of the four model 
species (stickleback, medaka, pufferfish). Due to the 
great evolutionary distance separating Nile tilapia from 
zebrafish (>300MY), it was not possible to reach defini- 
tive conclusions about synteny between these species. 
Furthermore, it provides a foundation for studying 
karyotypic evolution in the flocks of haplochromine 
cichlids in East Africa, including the evolution of sex 
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Figure 5 Comparative map of the Nile tilapia chromosome LG7. Column 1 corresponds to marker names. All markers are gene-based 
markers except (a) those with prefix "MS" which correspond to microsatellites (in blue) taken from Lee et al. (2005), (b) those with prefix "WG0" 
which are BAC end markers (in red) and (c) those with prefix "SNP" which correspond to SNP-based markers (in green). Column 2 corresponds to 
marker coordinates expressed in centiRays (cR 350 o)- Following columns correspond to comparative data with, from left to right, stickleback, 
pufferfish, medaka, zebrafish. For every marker, chromosome numbers and coordinates of the putative orthologs in the genome sequences of the 
four model species are displayed. CSO between Nile tilapia and stickleback/medaka/pufferfish are figured in boxes. 
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chromosomes [40-42] and the origins of B chromosomes 
[43]. By contributing to the construction of a golden 
path for the Nile tilapia genome assembly, these maps 
will enable QTL and association mapping of adaptive 
traits in each of the haplochromine species flocks. 

The mapping of a number of SNPs derived from 20 
individuals of the 10 th generation of the widely cultured 
GIFT strain are included. To the best of our knowledge 
these SNPs are the first set of genome wide SNPs pub- 
licly available for Nile tilapia. SNPs are gaining popular- 
ity for use in e.g. parental assignment [44] and for the 
estimation of genetic parameters in tilapia breeding. The 
272 SNP markers were shown to allow the discrimin- 
ation between different strains and species of tilapia (van 
Bers at al, submitted Molecular Ecology Resources), 
and will be used in the near future to assess the genetic 
diversity of natural populations of Nile tilapia (Richard 
Crooijmans, personal comm.). The physical mapping 
position of these SNPs determined in this study allows 
the selection of unlinked SNPs for these future applica- 
tions. Finally the map will help to place and orientate on 
the Nile tilapia karyotype many of the scaffolds identified 
in the forthcoming tilapia genome sequence determined 
with the Alumina technology and assembled by the 
BROAD Institute. 

Methods 

Construction of a Nile tilapia radiation hybrid panel 

The RH map was constructed from a fully-inbred homo- 
zygous clonal line of O. niloticus consisting of all-female 
fish, generated at the University of Stirling (Scotland, 
UK). These fish were derived by gynogenesis from a 
strain originating from Lake Manzala (Egypt) [45]. A 
panel of radiation hybrid cell lines was constructed using 
the methodology described previously [22,46]. Briefly for 
each fusion, a splenocyte suspension was prepared using 
one clonal fish as described in Guyon et al. [21]. The 
suspension was y-irradiated at 3500 rad. Splenocytes 
were fused with Hprt~ derivative CHO cells in a 5: 1 
ratio (Splenocyte/CHO) in the presence of polyethylene 
glycol 1500 (Roche, Mannheim, Germany). Cells were 
seeded in 6-well microplates at a total concentration of 
150,000 cells per well and cultivated with HAT medium 
for 3 to 4 weeks until hybrid clone appearance. Each 
clone was recovered and further cultivated under HAT 
selection approximately one week in a 60 mm diameter 
Petri dish. After trypsinisation DNA was extracted from 
individual clones using the NucleoSpin Tissue kit 
(Macherey-Nagel, Duren, Germany). DNA concentration 
was estimated by fluorescence quantitation using Quant- 
iT Picogreen assay kit and a Qubit measuring device 
(Invitrogen, Carlsbad NM, USA). DNA extracts of 
hybrid cell lines were amplified by a Whole Genome 
Amplification (WGA) procedure when additional 



material was needed. In these cases, two separate WGA 
were performed with 10 ng of DNA each using V2 Gen- 
omiPhi kits (GE healthcare, Fairfield CT, USA). WGA 
products were pooled providing -10 \ig of material for 
subsequent genotyping. The reliability of WGA was pre- 
viously demonstrated by Senger et al. [22]. Fishes were 
anesthetized with the addition of phenoxybarbtal in the 
water tank (3ml per 10 liter) and then sacrificed follow- 
ing the rules established by the ethical local committee. 

Marker selection 

Nile tilapia expressed sequences (ESTs and complete 
mRNA sequences) were either downloaded from NCBI 
nucleotide and RBEST (http://reprobio.nibb.ac.jp/) data- 
bases or provided by the CIRAD (Montpellier, France). 
After masking simple repeats using the RepeatMasker 
program [47] sequences were aligned together using the 
CAP3 software [48] with default parameters. The result- 
ing unique sequences were aligned on stickleback (vl.O 
assembly), pufferfish (v8 assembly), medaka (vl.O assem- 
bly) and zebrafish (Zv7 assembly) genome sequences 
using the Exonerate software [49] . Orthologous sequences 
were searched using a minimal score of 250 and an align- 
ment size of 80 to 300 nt. Sequences which showed 
conservation with the highest number of species were 
selected to design markers. For every marker, the coordin- 
ate of the best hit on each of the model genomes was 
considered as the location of the putative ortholog in the 
model species. In addition, BAC end sequences from the 
CIRAD and the University of Maryland (USA) [9] 
containing genes of interest were selected for the RH 
mapping. Microsatellites of the genetic linkage map of 
Nile tilapia (second generation) [15] were also selected in 
order to anchor the genetic map to the RH map. 

In addition SNP markers were identified in a pool of 
20 individuals obtained from the 10 th generation of the 
GIFT population (WorldFish Center, Malaysia). The de- 
velopment of the SNP markers is described in detail 
elsewhere (van Bers et al, in prep). In brief, an RRL was 
prepared and sequenced using the Illumina GAI sequen- 
cing technology. Pooled DNA was digested with the Rsal 
restriction enzyme and fragments of 3.5-4 kb were iso- 
lated by electrophoresis. The fragments were sheared 
and used for high throughput sequencing. Sequence 
reads stringently filtered for quality were first assembled 
using SSAKE [50] to constitute a reference draft se- 
quence. Less stringently filtered reads were subsequently 
mapped onto the reference draft using MAQ [51], allow- 
ing the detection of SNPs. The Minor Allele Frequency 
(MAF) was calculated based on how many times a SNP 
was observed in the sequence data. Only SNPs showing 
the minor allele at least three times were considered as 
true SNPs. Illumina type II SNPs with a design score 
>0.75 and a MAF >0.16 were selected for genotyping. 
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Marker sequences used to construct the map are given 
as supplemental Additional file 6: data S6 and have been 
submitted to NCBI: accession numbers, ss 244316446- 
244316740 (SNP markers) and 253831740-253831804 
and 75611463-75642120 (EST markers). BAC markers 
are deposited in http://www.BouillaBase.org. 

Genotyping 

Genes, microsatellites and BAC end markers were typed 
on the Nile tilapia RH panel using the 1536-marker Illu- 
mina BeadArrays system. Amplification of 45-55 bp loci 
was performed using oligonucleotides 20-23 nt in length 
complementary to the Nile tilapia sequences and designed 
using the Illumina proprietary design program. For this, 
the program was adapted to design oligonucleotides flank- 
ing non-polymorphic sites. Oligonucleotides were synthe- 
sized and spotted onto two 96-sample array matrices by 
the Illumina company (San Diego, CA, USA). The geno- 
typing was carried out using the Illumina GoldenGate 
technology. SNP markers were typed on the RH panel 
with a 384 SNP multiplex genotyping assay using the 
GoldenGate Assay. Oligonucleotides were designed flank- 
ing a SNP according to the Illumina design program spe- 
cifications. The assay was deployed on a BeadXpress 
platform using the Veracode technology. 

The Illumina Genome Studio software was used to 
visualize typing results and score the presence/absence 
of the markers in the hybrid cell lines by a method 
adapted from McKay et al. [52]. In our experiment all 
markers including the SNPs behaved as homozygous 
markers because the hybrid lines were constructed from 
a homozygous clonal Nile tilapia line implying homozy- 
gosity at all loci. Consequently a single allele-specific 
oligonucleotide was used in the GoldenGate extension 
step and only the axis of the corresponding fluoro- 
chrome was taken into account for a given marker. 
Clones located above a threshold of 0.3 on this axis on 
the cartesian plot were scored as present regardless to 
the value on the other axis. Clones located under the 
threshold were scored as absent. The threshold was 
adjusted depending on the overall distribution of dots 
on the typing profile. Clones located close to the thresh- 
old were scored as ambiguous (Figure 6). Microsatellites 
used to characterize hybrid cell lines and additional BAC 
end markers were typed by PCR and scored as described 
in Guyon et al. [21]. 

Data computation 

All vectors were integrated in a single file and a two- 
point analysis was performed using the Multimap v2.0 
software [36] starting at a lod score of 4.0. The multi- 
point analysis was performed with the CarthaGene vl.O 
software [53]. RH groups that harboured obvious aberra- 
tions were re-analysed at higher two-point lod scores of 



up to 7.0 before performing the multipoint analysis 
again. Distances between markers were expressed in 
centirays (cR 350 o). Coordinates of the putative ortholo- 
gous genes in the four model genomes were aligned with 
the corresponding Nile tilapia markers on the graphic 
representation. Ordered RH groups were tentatively 
oriented according to two-point lod scores between their 
end-markers. Similarly, comparison of the lod values 
obtained between the markers at the extremities of two 
RH groups, which are for other reasons supposed to be 
close to one another on the genome allowed to confirm 
or rule out this hypothesis. The microsatellite order on 
the RH map was compared with that of the linkage map 
[15]. Conserved Segments (CS) and Conserved Segments 
Ordered (CSO) [35,36] between Nile tilapia and the 
model species were identified using the AutoGRAPH 
web server [54]. 

CMap construction 

A comparative map viewer was constructed using the 
web-based tool CMap [55] in order to visualize and 
compare the RH map with the genetic map of O. niloti- 
cus [14,15]. Tab-delimited map and correspondence files 
were created between the two maps based on marker 
names and loaded into the CMap database. In addition, 
comparative maps were created between each of the RH 
map, the O. niloticus genetic map, as well as maps of 
two haplochromine cichlid lineages: Astatotilapia bur- 
toni [56] and Metriaclima zebra/Lab eotropheus fuelle- 
borni [57]. The comparative maps can be viewed 
through CMap at http://cichlid.umd.edu/cgi-bin/cmap/ 
viewer. 

Fluorescent In Situ Hybridization (FISH) 

In order to validate the anchoring of the RH groups to a 
particular linkage group and to orient correctly the groups 
on the corresponding chromosome, physical mapping was 
performed by fluorescent in situ hybridization (FISH) 
using as probes at least two BACs per linkage group/ 
chromosome. 

Chromosome preparations 

Chromosomes were prepared by direct in vivo methods 
from spleen and head-kidney cell suspensions as 
described in Fischer et al. [58], with a hypotonic treat- 
ment performed in a 28°C water bath for only 20 min. 
All chromosomes preparations were made from the XX 
genotype of the sequenced strain. 

BAC clones preparation and purification 

BAC DNA were purified from two Nile tilapia BAC 
libraries, T3 library (mean insert size 145 kb) and T4 li- 
brary (mean insert size 194 kb) [59]. Individual clones 
were cultured in 100 ml 2YT broth with 12.5 (ig/ml 
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Figure 6 Cartesian plots of radiation hybrid typing by the GoldenGate technology. (A) Dots located above a threshold of 0.30 on the y-axis 
corresponded to positive clones scored "1", dots located under the threshold corresponded to negative clones scored "0". Dots located close to 
the threshold were considered as ambiguous results scored "2" (grey dots). (B) According to the overall repartition of dots on the profile of 
typing the threshold was lowered to 0.20 on the x-axis. 



chloramphenicol at 37°C for 24 hours. BAC DNA was 
isolated using the plasmid midi kit (Qiagen) following the 
manufacturers protocol, obtaining between 20 to 50 \ig 
yields. The BAC DNA was then validated and its quality 
verified before FISH by PCR using specific primers. 

Fluorescence in situ hybridization 

BAC probes were prepared from 2 \ig of BAC preparation 
fragmented by heating at 98°C for 30 min and subsequently 
labelled with either DIG- or biotin using a High Prime 
DNA labelling kit (Roche Applied Science), according to 
the suppliers protocol. To facilitate double BAC FISH 
experiments, we first prepared stock solutions for each 
component which were stored separately at -20°C. The 
BAC probe pellet was resuspended in the hybridization buf- 
fer (50% formamide, 2xSSC, 10% dextran sulphate and 
50 mM of sodium phosphate) at a concentration of 16 ng/ 
\A and incubated overnight at 37°C under constant agita- 
tion. The competitor consisted of sonicated O. niloticus 
DNA and the carrier was sonicated bovine DNA. Both were 
resuspended in hybridization buffer at 8 |ig/|il and 10 Hg/^il, 
respectively. 

For the FISH, 2.5 \A of BAC probe, 1 \A of competitor 
and 4 \A of carrier were preheated at 45°C in a water bath 
before mixing. The BAC probe mixture was then dena- 
tured for 5 min at 85°C and pre-hybridized in a water bath 



at 37°C for 90 min to eliminate non-specific signals gener- 
ated by small abundant repetitive sequences (essentially 
microsatellites) present in BAC inserts or generated by the 
BAC vector. For the double FISH, both BAC probes were 
pooled, mixed just prior to the hybridization. Chromo- 
somes on slide preparations were denatured for 10 seconds 
in 70% formamide/2x SSC at 72°C, followed by a dehydra- 
tion in 70%, 80%, 98% ethanol bath series. After quicldy air 
drying the slides, the reannealed probe mixture was loaded 
onto the slides, covered with 22 x 22 mm plastic coverslips, 
and hybridized at 37°C in a moist chamber during 48 h. 
For the post-hybridization, the coverslip was removed and 
the slides were washed in 0,4x SSC, 0,3% Tween 20 (v/v) 
at 60°C for 2 min and 2x SSC, 0.1% Tween 20 (v/v) at 
room temperature for 1 min. The hybridized probes were 
detected with 30 \A of a dual colour solution of anti-dig 
Rhodamin/streptavidin-FITC (Roche Diagnostics) placed 
under a 24 x 40 mm coverslip, during 5 min in the dark. 
Slides were then washed three times in 4x SSC, 0.1% 
Tween 20, 2 min each followed by dehydration in a series 
of increasing ethanol percentages. 

For the FISH observations, the slides were mounted in 
DAPI/antifade and analysed with a fluorescent microscope 
Zeiss Axio imager Ml equipped with a CoolSNAP camera 
(Photometries) and the animal karyotyping/FISH imaging 
software Genus (Genetix). 
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Additional files 



Additional file 1. Data SI: List of the 237 gene markers derived from 
contigs made with CAP3 software using fish ESTs from NCBI, RBEST and 
CIRAD as listed under the name of each marker. 

Additional file 2. Data S2: RH vector suite 

Additional file 3. Data S3: Table of RH group characteristics. 

Additional file 4. Data S4: Integrated maps of the 22 chromosomes. 
The legend of Figure 3A applies to each of the 22 chromosome figures. 

Additional file 5. Data S5: Comparative maps of the 22 chromosomes. 
The legend of Figure 5 applies to this file. The order of the columns is 
the same as in Figure 5. 

Additional file 6. Names and nucleotide sequences of markers 
located on the RH map. 
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